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^ Abstract 



In this paper, we introduce a family of second-order sigma delta quantization schemes 
for analog-to-digital conversion which are 'quiet': quantization output is guaranteed to 
fall to zero at the onset of vanishing input. In the process, we prove that the origin is a 
globally attractive fixed point for the related family of asymmetrically-damped piecewise 
affine maps. Our proof of convergence is twofold: first, we construct a trapping set using 
a Lyapunov-type argument; we then take advantage of the asymmetric structure of the 
^ r~| maps under consideration to prove convergence to the origin from within this trapping set. 

Key words: piecewise affine maps, attractivity, trapping set, Lyapunov function, analog- 
, ^ , to-digital conversion, sigma delta modulation, quiet quantization, idle tones 
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> 1 Introduction 
in 

Analog-to-digital conversion is the study of accurate and tractable methods for the approx- 

CN imation of real- valued signals using a finite alphabet. It is of great importance as many 

^ signals of interest such as audio, are naturally produced in analog form, while it is becoming 

O increasingly efficient to store and manipulate information in digital format. 

2 The process of analog-to-digital conversion usually consists of two parts: sampling and 

^ quantization. Sampling consists of converting the continuous-time signal of interest f{t) into a 

• i-H discrete-time signal f{tn)^ and quantization is the process of mapping the discrete-time signal 

^ f{tn) into a sequence of discrete values, in such a way that the original function f{t) can be 

^ reconstructed, albeit imperfectly, from these discrete values at a later time. 



Sampling. In the setting of analog-to-digital conversion, the signal of interest f{t) is often 
modeled as a bounded, bandlimited function. Specifically, the signal is assumed to belong 
to the space Bq of real- valued continuous functions that are bounded in L^(R), and whose 
Fourier transforms (as distributions) have support contained in [— Q/2,Q/2] for a known 
bandwidth ft. For example, speech signals can be modeled as bandlimited functions whose 
bandwidth is 4 KHz, and audio signals in general are well-modeled as bandlimited functions 
with bandwidth 20 KHz. For ease of presentation, let us fix O = 1 in the sequel; all of 
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our results can be extended to general bandwidths by change of variables. For the Fourier 
transform, we shall use the normalization 



/oo 
f{t)exp{-2TTiujt)dt (1) 
-OO 



for / G i^^(M), and extended to the space of tempered distributions in the usual way. 

For bandlimited functions /, the low frequency content of the sequence of samples (/(f ))^^^ 
is the function itself; therefore, such functions can be reconstructed using a low-pass filter. 
In mathematical terms, this intuition corresponds to the Shannon-Nyquist sampling theorem: 
if A > 1, then any function / G 0i can be recovered as a weighted sum of translates of an 
averaging kernel g e L^{m) via the formula 

where g is any bounded and continuous function whose Fourier transform ^ satisfies 

^, . r 1, if \oj\ < 1/2 

^^^^ = 1 0, if \u\ > Ao/2 
for some arbitrary Aq with A > Aq > 1. 



Quantization. Given a sequence of samples (/(f ))^^^ associated to a bandlimited function 
/ G a K-level quantization scheme assigns a sequence of quantized values from an 
alphabet Ajc of size \Ak\ = K in such a way that the function 

hit) = \j:in9{t-l) (4) 

serves as a good approximation to f{t) = ^ Z^^gZ /(x)^(^ ~ j)- ^ natural choice for the 
discrete coefficient q^ in Q is the truncated K-1 term binary expansion of the amplitude |/;^|, 
multiplied by the sign of - such binary quantization schemes are an industry standard for 
digitizing audio signals. However, binary quantization suffers from various implementation 
difficulties and disadvantages in practice, mostly related to the cost of building analog circuits 
that can carry out binary expansions accurately to many digits; in practice, it is not always 
the quantization scheme of choice among engineers. 

In many applications, oversampled coarse quantization is instead preferred, where a fixed 
number of levels — sometimes as few as two levels, q^ G { — 1,1} — is allocated to each 
sample at the more tolerated expense of very high sampling rate compared to the rate 
A ^ 1 which is sufficient for binary quantization. From the viewpoint of circuit engineering, 
oversampled coarse quantization is associated to low-cost analog hardware, because increasing 
the sampling rate is cheaper than refining the quantization. Further advantages of oversam- 
pled coarse quantization methods include a built-in redundancy and robustness against errors 
resulting from imperfections in the analog circuit implementation. This robustness comes as 
a consequence of the more 'democratic' distribution of bit significance in the reconstruction 
formula Q; we refer the reader to [3j for more details. 
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1.1 Sigma Delta quantization 

In sigma delta (SA) quantization, one of the most widely-used oversampled quantization 
methods in practice, is dynamically updated as a function of previous and in such a 
way that the frequency content of the quantization error is pushed to high frequencies; these 
high-frequency error components are then cancelled out by the convolution, Q, which acts 
as a low-pass filter. In mathematical terms, q^ being a high-pass sequence means that the 
difference — q^ is the mth order difference of a bounded auxiliary sequence Vn- Precisely, 
for an mth order SA quantization scheme, the sequence of coefficients q^ ^ A is chosen such 
that 

m /X 

- = (A-^). J2(-^yrW+rn-l. n G Z, (5) 

The quantization error associated to such quantization schemes may be bounded as follows: 

Proposition 1.1. Fix f ^ Bi, and furthermore suppose that \\f\\oo ^ <^ < 1 for some 
a < 1. Fix a filter g satisfying the assumptions and suppose further that g and its first 
m derivatives belong to L^{W). Fix an over sampling ratio A > Aq; and let A be a discrete 
alphabet consisting of equispaced values with endpoints +1 and —1. Suppose that there exists 
a sequence of coefficients q^ ^ A and a bounded state sequence {vn)'^=Q for which the mth 
order relation ^ is satisfied. Then one may set to = to(m. A) > sufficiently large that for 
all t > to, 

^ CO 

f{t)-jY.1n9{t-l)\<CX-^; (6) 

n=l 

the constant C appearing above depends only on \\v\\oo and on ||^||li(r)5 11^^"^^ IIli(R)5 •••11^^^^ IIli(r^ 

For a rigorous proof of Proposition we refer the reader to [4J . Note that in the error 
estimate Q, the index n ranges over the positive integers only; this change of setup yields 
no difficulties, given that a filter g satisfying ([s]) can be made to be well-localized in time by 
requiring that its Fourier transform be sufficiently smooth, see [lOj for more details. 

Remark 1.2. We shall refer to any quantization scheme satisfying an error estimate of the 
form ^ as an mth order quantization scheme. 

Construction of HA quantization schemes. We now turn to the issue of existence of 
quantization schemes satisfying the difference relation ([s]). Although high-order SA quantizers 
have been implemented in practice for many years [15], the construction of SA schemes of 
arbitrary order m > 1 for which the state sequence {vn)neN is guaranteed to remain bounded 
has been only recently achieved by Daubechies and DeVore in [4J: 

Proposition 1.3 (Daubechies and DeVore). Suppose that f , g, and A satisfy the assumptions 
of Proposition [77/} Take A > 1^ and consider a quantizer function Q : [—a, a] A that 
satisfies 

Q{u) — sign{u) for \u\ > 1/2. 
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Then there exist admissible functions F : R for which the recursion 



u, 




m 



(7) 



with initial conditions Uq = for j = l,...,m, generates a sequence {vn)neN '•— {"^n jneN 
that is bounded and satisfies the finite difference equation ^ . 

A few remarks are in order. 

1. The recursions Q are not the only means for generating coarse quantization schemes 
having mth order accuracy ([g]); therefore, we shall refer to them as the standard HA 
recursions. For a comprehensive overview on more general setup for coarse quantization, 
we refer the reader to [12j. 

2. In the case m = 1, one may take Vn-i) — fn + '^n-i as admissible function in Q, 
and the recursion reduces to 



In this case, one may verify the boundedness of {vn)n&^ by induction: if |/^| < 1 for all 
n > 1, and if \vo\ < 3/2, then \vn\ < 3/2 for all n > 0. 

3. For m = 2, the admissible functions of the form Fj{fn,Un-i,Vn-i) — + '^n-i 
guarantee boundedness of the sequence (vn)neN and \\v\\oo < Ca^ with the constant Ca 
depending only on |/^| < a < 1, for a range 7 = 7(0^) > 1; see [18] for more details. 

4. For higher orders m > 3, the only admissible functions F = that have been proven 
to guarantee boundedness of the {vn)neN foi" recursions of the form Q were constructed 
in [1]; they are defined recursively with respect to the order m, and become increasingly 
complicated with increasing order. Nevertheless, stability has been shown for a differ- 
ent class of recursions generating mth order quantization schemes, and the criteria for 
stability there are more aligned with the schemes implemented in practice [12j. Still, 
proving stability for a wider class of admissible functions for orders m > 3 remains a 
challenging open problem of interest to both mathematicians and engineers. 

1.2 Quiet SA quantization: defeating infinite memory 

As previously mentioned, the input signals in the context of analog-to-digital conversion are 
well-modelled as bandlimited functions. Nevertheless, audio signals in actuality have finite 
time support, completely vanishing over intervals of time such as between speech phrases 
or musical tracks, and ultimately, vanishing indefinitely. As bandlimited functions are re- 
strictions of analytic functions to the real line, such functions cannot vanish identically on 
an interval unless they are identically zero; nevertheless, bandlimited functions can become 
arbitrarily small over arbitrarily large intervals of time, and still provide a good model for 
audio signals. 



Vn = Vn-1 + fn- Q{Vn-l + fn)- 



(8) 
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For efficiency reasons, it is desirable that the quantization sequence {qn)neN^ in turn 
the state sequence {vn)neNi f^H to zero in response to vanishing input fn = f{j) — 0- For 
instance, as the quantization level = corresponds to zero-voltage level, quantization 
schemes as such can essentially 'shut off' over the off-support of the input, rendering them 
low-power. Indeed, there are devices that drive speakers with very high efficiency which use 
a three-level quantization alphabet Qn G { — 1, 0, 1}, dating back to a patent issued to Crystal 
Semiconductor, Inc. (now Cirrus logic), see [16j for more details. For efficiency reasons 
as explained in that patent, it is desirable to maximize the number of 'O's and minimize 
the number of +l's and — I's in the tri-level quantization output. In particular, devices 
which implement SA quantization with tri-level quantization are in widespread use in modern 
electronic technology [Ij, see the website of Analog Devices for more details. 

Remark 1.4. We shall refer to any quantization scheme for which the quantization sequence 
{Qn)neN^ and the state sequence {vn)neN ff^ll to zero at the onset of vanishing input fn^Oas 
quiet . 

Unfortunately, the quantization output produced by the standard SA schemes, Q is not 
quiet; in contrast, quantization sequences qn produced by such recursions generally fall into 
periodic cycles at the onset of vanishing input. The treatment of such zero-input periodicities, 
which can cause spurious idle tones in the reconstructed signal, has been an area of active 
research over the past twenty years, see the patent [13], and also the papers [5], [H] for 
more details. One of the methods used in practice to break such cycles is to apply dither, or 
random noise, to the sample input; note that mth-order accuracy in the sense of ^ is no 
longer obtained subject to the application of dither. 

Modifying the standard HA recursions to be quiet. Let us consider the task of mod- 
ifying the standard SA recursions ([7| to be quiet, or so that := {un\un \ ...^uIt^"^) and 
qn fall to zero in response to vanishing input, and doing so while still maintaining mth order 
accuracy of the resulting approximations. We shall restrict attention to quantization schemes 
using tri-level quantizers of the form Q{u) = Qtri{u)^ where 

r 1 u>l/2, 

Qtr^{u) = < -1/2 <U< 1/2, (9) 

[ -1 < -1/2. 

As discussed previously, tri-level quantizers are often implemented in quantization devices 
which are built primarily for high efficiency, which is the motivational setting for quiet quan- 
tization. 

Dynamical system interpretation. When fn — 0, the standard mth order recursion 
(7| reduces to an m-dimensional piecewise-affine map. Implemented with tri-level quantizer 
(9]), and assuming that —1/2 < F(0, 0) < 1/2, the state x = is a fixed point of this so- 
called zero-input map. Therefore, quietness is achieved if the standard recursions Q may be 
modified so that x = is an attractive fixed point for the zero-input map, at least within a 
neighborhood containing the bounded sequence (x^)^^i^, and if this can be done while still 
maintaining boundedness of the sequence {vn)n^n^ and mth order reconstruction accuracy ^ 
of the scheme. 
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Quietness for first-order SA quantization. Quietness for the first-order SA scheme 
m = 1, implemented with tri-level quantizer, has been previously studied previously in [18j . 
There, the author shows that quietness is achieved by composing the first-order recursion ([s]) 
with a contraction Vn pvn^ leading to the map Vn — pvn-i ^ fn — QiP'^n-i + fn)- Indeed, 
it is not hard to verify that 

1. the uniform bound \vn\ < 3/2 holds subject to this modification, 

2. the fixed point = is a globally attracting fixed point of the zero-input map Vn = 

pvn-i - Qtri{pvn-i), and 

3. the modified recursion Vn = pvn-i + fn — QtriiP'^n-i + fn) still represents a first-order 
scheme if the damping is such that p > px 1 — where A is the oversampling ratio. 
We refer the reader to [18j for a proof of these results. 



Quietness for second-order SA quantization. Unfortunately, first-order SA schemes 
are rarely used in practice, and we would like to obtain similar quiet modifications of the 
higher-order standard recursions ([t]). Second-order SA schemes are often preferred in practice 
for having improved reconstruction error over first-order schemes, while still maintaining 
stability and ease of implementation. For the remainder of this paper, we shall concentrate 
only the case m = 2; we leave the analysis of higher-order quiet HA schemes to future work. 

As mentioned previously, the simplest admissible functions for the second-order recursion 
Q are of the form F{u, v) = ju + v. For this case and for a range of 7 > 1 that depends on the 
parameter a in the bound < <^ < 1, the sequence {vn)neN generated by the second-order 
scheme ([7| is guaranteed to remain bounded. This is shown in by constructing a family 
of convex sets Sa C M? which are invariant under the recursion. Implemented with tri-level 
quantizer and linear rule F{u,v)=ju-\-v^ the second-order recursion Q has the form. 



(^0,^0) 



(0,0), 



-1, < -1/2 



qn 



0, 

1, 



F{Un-l,Vn-l] 



< 1/2 



F{Un-UVn-l) > 1/2, 
T{{Un-l, Vn-l), fn) (^n-1 + fn - 



-1 +Vn-1 + fn- qn)' (10) 



It is natural to wonder whether the damped modification (un^Vn) = T(p(un-i^Vn-i)^ fn) 
might induce quietness in this case as it did for the first-order recursion ([s]). Indeed, the 
behavior of such a 'leaky' modification has been studied extensively in its own right, for the 
behavior of its resulting periodic cycles [6J, and for its approximation accuracy [18j. With 
respect to the latter point, it has been shown that the boundedness of the state sequence 
{vn)neN is not affected by this modification, as the invariant sets for the map (un^i^Vn+i) = 
T{{un,Vn)i fn) ^trc convcx and contain the origin. Moreover, identifying Un — pun-i — — 
2pVn-i + p^Vn-2, one verifies that second-order accuracy is maintained for the leaky scheme 

at 



(Un.Vn) = T{p{Un- 



-i),/n) if the damping factor p is bounded below hy px — 1 
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each step: 

fit) -f it) = \T.(fn-qn)9{t-'^ 

n>l 



n>l 



?2>1 ?2>1 

(11) 



By Proposition [lT} the first term in the ultimate expression is bounded in magnitude by CA ^ 
for t > to (A) sufficiently large, and the second term is bounded in mag nitudeby CX-^{l-p) < 
CX~^. This is because the first term corresponds to the approximation error for a standard 
second-order SA recursion, and the second term corresponds to the approximation error 
incurred for a standard first-order SA recursion, multiplied by 2(1 — p). At the same time, 
the third term is proportional to (1 — p)^C = \~^C for a constant C depending on ||^||li(r) 
and 11'?^ I loo only. In total, we arrive at the desired second-oder bound: \f{t) — f{t)\ < C\~^ 
for t > to(A). 



The second-order leaky modification is not quiet. Unfortunately, despite retaining 
second-order accuracy, leaky modifications to the standard second-order recursion of the form 
{un^Vji) — T{p{un-i,Vn-i)^ fn) Still do not achicvc quietness, because the origin is not an 
attractive fixed point for the piecewise affine zero-input map {un^Vn) — T(pn{un-i^Vn-i)^0) 
in a sufficiently large neighborhood. For example, if uq = {1 + p)~^, vo — 0, and fi = 
(1 + p)~^ at the onset of zero input /2 = = 0,n > 2, the system falls into period-2 
oscillation. Nor are such period-2 trajectories 'pathological situations' in an otherwise well- 
behaved neighborhood of convergent trajectories; numerical results such as those in Figure [l] 
suggest that the neighborhood of attractivity for the piecewise-affine system (t^n+i, '^n+i) = 
T{p{un, Vn), 0) shrinks to a one-dimensional subset of as p ^ 1. 



2 The main result 

As it turns out, the situation changes completely if we apply damping, but not at every 
iteration, only at iterations n for which Un > (or the symmetric, when Un < 0). As the main 
contribution of this paper, we introduce the following asymmetrically- damped variant of the 
standard second-order DA recursion (10), 



{Un+l,Vn+l) = T({Un,Vn), fn) - 



T{p{Un,Vn)Jn 



>o, 
<o. 



(12) 



and we shall show that this modification is guaranteed to be quiet. As far as the author 
is aware, the asymmetric scheme (12) is the first such example of a coarse quantization 
scheme which obtains second-order approximation accuracy. Following the argument (11), 
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Figure 1: We indicate whether or not the trajectory of xq = (uq^vq) under the map (li^+i, '^n+i) 



T(^p{un^ Vn)^ O), as defined in (10), converges to zero, for 100 equispaced values of p G [.96, 1) and initial 
conditions (uq^O) in the interval uq G [—2,0]. The figure suggests that the neighborhood of attraction 
of the map shrinks to a one-dimensional set as p ^ 1, as the gray 'tornado' regions indicate initial 
values Uq for which, after one million iterations, the discrete output G { — 1,0,1} has a nontrivial 
period of fewer than 100 iterations that persists over an additional one million iterations. In contrast. 



the analogous plot for the asymmetric variant of this map, (12), produces no gray regions. 



this asymmetric scheme is second-order if p > 1 — ^. Moreover, as a consequence of the 
asymmetry of the damping, the Un and in turn the Vn, are forced to zero at the onset of 
zero input, no matter how small one sets the difference 1 — p, and independent of the initial 
conditions (uq^vq). 



Numerical Illustration. In Figure 5 ^a), we plot the approximation fs{t) to an identically 



zero signal /(t) = as produced by the standard second-order SA recursion with tri- level 
quantizer, ( [To| ). The idle tones produced by the periodicity in the output {qn)neN are easily 
visible as spikes^in the frequency domain of this reconstruction. In Figure |^[b)| we plot a 
reconstruction fq produced by the quiet scheme, (12); the spikes in the frequency domain 
have been clearly smeared out. 

The remainder of the paper shall be devoted to proving that the origin is a globally attracting 
fixed-point of the zero-input map (un+i^Vn+i) = r((i/,^, i;^), O) , as a consequence of the 
following theorem. 

Theorem 2.1. Consider the linear operator A : ^ given by A(u,v) = {u,u + v), the 
vectors 1 = (1, 1), c = (7, 1)^ and d = (1, 0)^ and the piecewise affine map 



Xr; 



Ax^ + 1, z/ (c,x^) < -1/2 
Ax^, if I (c,x^) I < 1/2 

Ax^-1, if (c,x^) > 1/2. 



(13) 



For any fixed amplification factor 7 > 1 and damping factor < p < 1, the origin is a globally 
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(a) We plot /s(t), an approximation to f{t) = using the standard second-oder scheme 
(10), with oversampUng rate A = 100 and parameter 7 = 2, and starting from initial 
conditions (uq^vq) — (.5, .3). This reconstruction is represented in time and also in the 
frequency domain. Spikes in the frequency domain, resulting from periodicities in the 
quantization output, can create idle tones in the reconstructed signal. 
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(b) We plot /g(t), an approximation to /(t) = using the introduced quiet second-oder 



scheme (12) with oversampling rate A = 100, parameter 7 = 2, and damping factor 
p — .99, and starting from the same initial condition {uq^vq) — (.5, .3), represented in 
timeand also in frequency. The spikes in the frequency domain have been smeared out. 



Figure 2: Comparison of the standard and asymmetrically-damped second-order HA quanti- 
zation schemes. 

attracting fixed point for the asymmetrically- damped piecewise affine map, 

M : ^ 



Xn+l 



Tx 

■J- -^n. 1 



(d,x„) >0, 
(d,x„) < 0. 



Let us pause to discuss an additional application of the main theorem, and of the quiet 



quantization scheme (12). 
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Finite Impulse Response filter coefficient quantization. The reconstruction formulae 
([2]) and Q for sampling and quantization of bandlimited signals are particular examples of 
discrete-time linear filters, or implementations of the convolution between an infinite-length 
input sequence (xn)neN and a set of coefficients (cn)neNi generating output of the form = 
X^jgN ^j^n-j- The filter is called a finite impulse response (FIR) filter if the set of coefficients 
is finite-length. The Fourier transform of the output sequence {yn)n=i (i^i the sense of the 
discrete-time Fourier transform) is the product of the Fourier transform of the input sequence 
with the Fourier tranform of the set of coefficients, y{(jj) = c{(jj) • x{(jj). 

In some cases, one would like to reduce the hardware complexity of FIR filters by quan- 
tizing the coefficients {cj)jLi using only a few bits for each coefficient. If one simply rounds 
each coefficient value to its nearest quantized level (binary quantization), then the Fourier 
transform of the set of coefficients may be changed dramatically and therefore the frequency 
response of the filter is no longer useful. However, in some cases it is only necessary to main- 
tain the filter's frequency response in lower frequencies, and the rest of the frequencies are 
not important. 

In this case, the original real-valued coefficients of the filter cj are often modified by 
sending them through a recursion such as the standard DA recursion Q, to obtain a new set 
of quantized coefficients qj, see [T4], [8]. The frequency response of the filter is maintained on 
lower frequencies subject to such quantization; to see this, let us analyze the Fourier transform 
of the quantization error: 

e{(jj) := (c{ljj) — q{(jj)) • x{uj) 

(AMi;)(cj) • x(cj) 
= (2^^)"^ 'v{uj)'x{uj). (14) 

The problem with this technique is that, while the number of input coefficients is finite, the 
quantization output cannot be stopped without causing a large error in the Fourier transform 
of the quantized coefficients, if the standard noise shaping recursions ^ are applied, [T]. 



However, if one employs instead the quiet noise-shaper, ( 12 ) , then as a consequence of Theorem 



(2.1), the tri-level quantization output is guaranteed to go to zero after the sequence of input 
coefficients has been exhausted. This means that the set of quantized coefficients is slightly 
longer than the original coefficient set, as one must wait for the noise-shaper to fall into the 
'all-zeroes' state, but nevertheless is finite. 



In words, the introduced quiet SA noise-shaper (12), which is guaranteed to be quiet as 



a result of Theorem 2.1, allows a way to quantize coefficients to 3 levels, such that a finite 
set of input coefficients can produce a finite set of output coefficients, such that the Fourier 
transform of the two sets of coefficients match at low frequencies, and diverge at higher 
frequencies. 



3 Proof of Theorem 12.1 



The remainder of the paper is devoted to the proof of Theorem 2.1, and is essentially disjoint 



from the material presented in previous sections. Let us recall that piecewise affine maps are 
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discrete dynamical systems of the form 



T : 

Xn+i = Tyin = Ajyin + bj, if G (15) 

where the sets form a finite partition of the domain R^, and Aj G W^^^. It has 

been recently shown [2j that the attractivity problem, or deciding whether or not the origin is 
a global attracting fixed point of a discrete map, is in general undecidable for piecewise affine 
maps in dimension m > 2. Consequently, there is no universal procedure which can decide, 
given a generic piecewise affine map in dimension m > 2, whether all trajectories converge to 
zero. In order to address the attractivity problem for a particular piecewise affine map, one 
must then either verify that certain sufficient conditions hold, or develop a convergence proof 
for a restricted subclass containing the map of interest. Sufficient conditions for attractivity 
generally involve verifying the existence of a Lyapunov function V : having negative 

forward difference along trajectories of the map, Ay(x^+i,x^) = V^(x^+i) — ^(x^) < 0. For 
example, a recent result in this direction, from [Tj, states that x = is a globally attracting 
fixed point of the piecewise affine map x^+i = r(x^) if there exists a function V : MJ^ 
of the form 

y(x)=x^P^-x, xG^-, P^- = Pj>0, (16) 

which has negative forward difference along trajectories Ay(x^+i,x^) < 0. The constraints 
defining Lyapunov functions of this form may be recast as a set of linear matrix inequalities, 

AjP,Aj-Pj < 0, 

Pi^P^ > 0, VzgL, (17) 

which must be checked over over all pairs (i, j) for which it is possible that x/^ G fti and 
^k+i ^ ^i+i- Linear matrix inequalities can be either solved or shown to be nonexistent 
using standard linear programming solvers and as such, the construction or nonexistence of a 



Lyapunov function of the form (16) for a particular piecewise affine map may be determined 
in polynomial time. 



Unfortunately, Lyapunov functions such as the piecewise positive-definite functions in (16) 
which may be tested systematically are very restrictive, and are not applicable to many piece- 
wise affine maps of interest which nevertheless have a global fixed point. For example, as the 



reader may check, a Lyapunov function if the form (16) does not exist for the asymmetric 



piecewise affine map x^+i = M(x^,0) of Theorem 2.1 which is of interest to us. Neverthe- 
less, we are still able to prove that the origin is an attractive fixed point of the asymmetric 
piecewise affine map; the proof is divided into two parts: 

1. In Section 3.1, we construct a invariant set S for the asymmetrically-damped map M, 
and show that all orbits initialized in this set converge to the origin. 

2. In Section 3.2, we show, using a Lyapunov function argument, that S is also a global 
trapping set for M. 

It is not clear that the proof need necessarily be split in two parts as such. The Lyapunov 
function constructed in Section 3.1 only decreases along orbits lying outside the trapping set 
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5, and so can not be used to prove convergence directly. However, there could exist a different 
Lyapunov function that does decrease along all orbits of the map M. Yet, numerical results 
such as those in Figure [4] show a marked change in the behavior of orbits upon entering the 
invariant region 5, suggesting an inherent two-phasedness to the dynamics of the system. 
We will use the following notation to distinguish the regions over which = 1, 0, and — 1, 
respectively, when > 0: 

= {{u, i;) G : p{-fu + v) > 1/2}, Aq = {{u, i;) G : p\-fu + v\ < 1/2}, 

A_i = {{u,v) G : pi^u^v) < -1/2}. 

3.1 An invariant set and convergence to the origin 

In this subsection, we construct a invariant set for the asymmetrically-damped map M of 
Theorem O 



Proposition 3.1. Consider the regions 

= {(u,v) gM^ : -1/2 <-fu + v < l/2 + 7,0<ix< 1}, anrf 
S- = {{u,v) G : -(1/2 + 7) < 7^ + < 1/2,-1 <u<0}, 

The union of these two regions, S = U S~ , as depicted in Figure 11 , is a invariant set for 
the map x^+i = Mx^. Furthermore, for G S: 

• qn — 1 if and only if ^ ci'^d x^+i ^ S~ , 

• (ln — —l 'if ctnd only if ^ S~ and x^+i G , and 

• // I = |^/c2 1 — 1 ^'^d Qk — for all ki <k <k2, then and qk2 must have alternate 
signs, i.e. qk^qk^ = -1- 

Proof. We begin by showing that 5 is a invariant set for the asymmetrically damped map 
x' = Mx. Being S the union of two convex sets, each containing the origin, x G 5 implies 
Ax G 5 for A G [0, 1]. As such, it is sufficient to show that S is invariant for the undamped 



map x' = Tx of Theorem 2.1 By symmetry of the set S and the map T, we can assume 



without loss that x G 5+. We consider the two cases x G Aq and x G Ai separately. 

1. (ix, v) G Ai: In this region, {u' ^ v') :— T{u^ v) — (i^ — 1, + — 1), and so in particular 
-l<u' < 0. Then x' G S' if 

-(1/2 + 7) <iu +v' < 1/2, 

which is easily verified from the inequalities 

1/2 < 7ix + i; < 1/2 + 7, and 0<u<l. 

2. (i/, v) G Aq: Now, {u\ v') — {u,v ^ u), and so < < 1. Also, we have that 

-1/2 < -fu +v < 1/2 + 7, 
using the inequalities —l/2<^u-\-v< 1/2, < < 1, and 7 > 1. 
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We proceed to the second part of the proposition. Suppose that (x^)^q is a trajectory 
contained entirely in 5, and that = = 1, but = for 1 < n < Because = 1, 
xo G and uq E (0, 1). For the same reason, uk-i E (0, 1). But ui = pUQ — 1 < 0, and 
so a power of p multiphed by ui^ must also be negative, leading to a contradiction. A 

similar argument rules out the possibility that qi = Qk — but = for 1 < n < □ 

Upon each return to the set S^, x^ is 'tilted' by the damping x px, creating an 
imbalance that forces the iterates to zero. 

Lemma 3.2. Suppose that (uq,vq) G S, so that {un,Vn) — M{uo,vo) G S for all n > by 

positive invariance of S. The subsequence {un)nex+ consisting of indices for which Un > 
necessarily converges to zero as n ^ oo. 

Proof. First observe that the event x^ G 5^, or equivalently Un > 0, must keep recurring, for 



if not, then = for all n according to Proposition 3.1 , and Vn = vo-\- {n-\-l)uo diverges. We 
may then assume that the index set X+ represents an infinite subset of the natural numbers, 
and {un)n^z+ infinite subsequence of Moreover, it is clear from the alternating 

sign pattern of the qn that Un^^i — punj along indices nj in X+, so that the subsequence 
"^uj — P^UjiQ converges to zero as rij ^ oc. □ 

Convergence of the subsequence {un)nex+ does not immediately guarantee convergence of the 
full sequence (un)nez+i the residual sequence {un)nez+\x+ could form an infinite subse- 
quence converging to — 1 as A; ^ oc. However, we can ensure that this pathological situation 
does not occur. 

Proposition 3.3. // xq G S, then x^ = M^xq eventually becomes trapped in C S. 
Moreover, ||x^|| ^0 as n ^ oo. 



Proof. As a consequence of Lemma 3.2, we may fix e > 0, and assume without loss that 



(i/o, '^o) ^ and that i^o ^ ^- We break the proof into two cases: 

1. Suppose first that qi — 0, so that 7^x9 + '^0 ^ 1/2. Then ui — puQ, vi = p(vo + uq), and 

p{-fUi-\-vi) = p'^{-fUo + vo) + p'^uo < + < 1/2, 
as long as e < (1 — p)/{2p^). Consequently, x^ G for all n, and 

x^ = p^A^yiQ = p^{ui^, uo + nvo). 
Since np'^ 0, it follows that ||x^|| 0. 

2. It remains to consider orbits satisfying x^ G if and only if = L Such trajectories 
are constrained as follows: 

(a) If x^ G 5^, then = 1, and 

Un+l pUn - 1 < pe - 1, 

Vn+l PVn + PUn - I < pVn + pe - 1 
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(b) If G S~ and Qn = 0, then x^+i G S~ , and 

i/^+l = Un < e - 1, 

(c) If E 5" and g^i = — 1, then Qn+i ^ {0, 1}, and 

Vn+1 = Vn + Un + l <Vn + e 

Since (c) cannot occur in successive iterations, we arrive at the period-2 inequahty 

<^^n + 2e-l, (18) 

indicating that the iterates Vn diverge. This case, then, cannot occur, and we conclude 
by Case 1 that ||x^|| 0. 

□ 

3.2 The invariant set aS as a global trapping set 

We show now that the invariant set S constructed in the last section is also a global trapping 
set for the asymmetric map; in light of Proposition 3, this guarantees that the origin is a 
globally attracting fixed point for the map M. 

Before proceeding, we will need the following general lemma: 

Lemma 3.4. Let S be a invariant set for a discrete map M on a set X. Suppose there exists 
a nonnegative function V : X ^ and a parameter S > with the property that for any 
X G X \ 5; either M^y. e S or V{M^x.) - ^ (x) < -6 after a finite time k. Then, S is a global 
trapping set for M. 

Proof Suppose V satisfies the hypotheses, and that x G X \ 5 is such that M^x ^ S for all 
k > 0. Let c = V^(x). From the stated hypotheses, V{M^^x) < c — 6 after some finite time fci, 
and, by induction, l/(M^^x) < c — nS after a finite time kn for any positive integer n. But 
then eventually y(M^x) < 0, which is impossible since > 0. □ 



Lyapunov functions of the form (16) presented in Proposition ?? do not exist for the map 



M, or even the symmetric map T, as we invite the reader to verify. Instead, we follow the 



approach in [19j where trapping sets for the second-order SA scheme (10) are constructed, in 
the slightly different setting where quantizer Q{u) —sign.{u) is considered instead of tri-level 
quantizer QtH^ and we consider the following Lyapunov function: 

V{u,v) = + \2v - u\. (19) 

The motivation for this V is as follows: letting V^{u,v) = + 2v — u and V~{u,v) = 
— 2v ^ it is easily verified that V{u^v) — max i;), i;)}. Also, and V~ 

are the unique functional solutions to the equations 



U,V] 
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As it turns out, the set of points x for which V may have positive forward difference under 
iteration of the map T, AV{Tx^ x) = V{Tx) — V{x) > 0, is contained in the invariant set S 



of Proposition 3.1 



Proposition 3.5. Consider the map T of Theorem 2J_, the convex function V : M? ^ 
given by V{u, v) = + \2v — u\, and the set i? = i?i U R2, with 

= {{u,v)^n^: 7i/ + i;>0, 2v^u<l, < 1/2}, 
i?2 = {{u,v)^M^: 7i/ + i;<0, 2v^u>-l, u>-l/2}. 

If AV{T:k^x.) > 0, then x G i?. Moreover R is contained in the invariant set S of Proposition 

EH 

We defer the proof of Proposition |3.5| which amounts to a straightforward case by case 
analysis, to Section 5. The set R is displayed in Figure [3) along with the invariant set S. 




Figure 3: The region {(u^v) : T(V{u^v)) — T{u^v) > 0} over which the Lyapunov function 
V{u, v) = ^ \2v — u\ may have positive forward difference under the map T is contained in 
i?, which is represented by light gray triangles, and is superimposed over the global trapping 
set S. The parameter used above is 7 = 5. 



With Proposition 3^ and Lemma 3^ in hand, we are now ready to prove the main result 
of this section. 

Proposition 3.6. The invariant set S is a global trapping set for the asymmetrically- damped 
map M. 

Proof For sufficiently small e > 0, the set {x : T^(x) < e} is contained in S. We will verify 
the conditions in Lemma 3.4 for S and V, using 6 := e(l — p) > 0. The proof is split into 
three cases. 
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1. Suppose first that x = (u^v) lies in the positive half plane u >0 where Mx = T{px.). 
If X is not in the invariant set S but x' = px is, then Mx = Tx' E 5 by invariance of S 
for T. If on the other hand px ^ S, then 

l/(Mx) - l/(x) = V{T{p^)) - l/(x) 



< V{px.) — V^(x), by Proposition 3^ 



< py(x) - y(x), by convexity of V and ^(0) = 

= -(i-p)y(x) 

< -e(l-p), as y(x) > e if X ^ 5 
= -6. 

Thus, if X is in the positive half plane but not in 5, then either Mx G 5 or V{Mx.) — 
l/(x) < -S. 

2. Suppose now that x is in the negative half plane, u < 0, and also in the set Aq \ S. We 
compare V^(x) and "\/(Mx) explicitly: 

V{u, v) =u'^ + \2v - u\, V{M{u, v)) = V{u, u + v) = + \2v + u\. 



By inspection of Proposition 3.1, u and v must have opposite signs in this region, > 1, 
and l^^l > 7 > 1. It follows that 

V{M{u,v)) -V{u,v) < -2. 



3. We have verified the assumptions of Lemma 3.4 for all x G M \ 5 whose trajectories 
are either eventually contained in S or in the right half plane or in the left half plane 
intersected with the region Aq. In fact, all x G can be described as such. Assume for 
purposes of contradiction that there exists a point x whose entire trajectory x/^ = M^x 
lies in A_i n {{u^v) : < 0}, so that qk = —1 for all k > 0. But then Uk — ^ k 
becomes arbitrarily large and positive for increasing A:, an obvious contradiction to the 
assumption that Uk < ^ for all k. This is a contradiction to the assumption that 
+ Vk < —1/2, and this case is rendered impossible. The same argument obviates 
the possibility that any trajectory x/^ = M^x lies entirely in Ai D {{u, v) : u < 0} . 

We conclude that after a finite number of iterations A:, either M^ G S' or y (Mx) — V^(x) < 
5. □ 



Theorem 2.1 follows from Proposition 3.6 and Proposition 3.3 



4 Conclusions and open problems 



The class of piecewise affine maps in Theorem |2.1| for which we are able to prove global 
convergence is very restrictive; indeed, numerical experiments suggest that attractivity holds 
over a much broader class of piecewise affine asymmetric maps. Interestingly, the most natural 



generalizations of Theorem 2.1 from a theoretical perspective are also physically meaningful 
in the context of SA quantization. For instance, the symmetric piecewise affine map T of 
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Figure 4: Different niagnifications of an orbit of the map M for p = .98 and 7 = 5. The initial 
point {uq^vq) = (—3.4, 12.7) can be seen in the top image; this point and the first few iterations are 
outside the trapping set S (gray parallelograms). Once trapped in 5, the iterates (un^Vn) converge to 
the globally attracting fixed point (0,0). 



Theorem |2.1| corresponding to the original second-order HA scheme has three affine regions, 
corresponding to g = — 1, 0, and 1, while the asymmetric map M that is constructed to have a 
global attracting fixed point has six affine regions. Numerical results suggest that six regions 
are not necessary, and that attractivity holds for asymmetric maps consisting of three affine 
regions only including maps of the form, 

{Ax^ + 1, if (c,x^) < -T 
Apx^, if I (c,x^) I < T (20) 
Ap:sLri - 1, if (c,x^) > T. 

Note that the value r = 1/2 is no longer constrained; this is because global attractivity of the 
origin seems to independent of the particular value of r > 0. This is important in practice 
if the recursion is to be built into an analog circuit, where thermal fluctuations and other 
non-idealities cause fluctuations in all of the analog circuit components. 

We conjecture that the origin is also a globally attracting fixed point for four-level asym- 
metric maps of the form, 

where pi,p2,5i,52 are positive parameters less than or equal to 1, with 62 being strictly less 



than one. This more general framework is a natural extension of Theorem |2.1| and also serves 
as a more realistic model for SA quantization when implemented in analog circuitry, where, 
after one clock time, a small amount of integrator leakage on each of the two delay elements 
required to hold each of the states Un and Vn is unavoidable. Integrator leakage has the effect 
of reducing the stored input in the first delay to a fraction p of its original value, and reducing 
the stored input in the second delay by a fraction 6 of its original value; in most circuits of 
interest, (p, 6) G [.95, 1]^, but the specific leakage factors within this window are generally un- 
known and may vary slowly in time (see [9J, p. 485, and also [17j). The model ( [2T| ) also allows 
for the possibility of inducing leakage on the second delay element only, (i/^, Vn) {un^ P'^n)^ 
thus simplifying the recursion. 
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The class of four-level maps (21 ) are more complex than the three-level maps (20), but exhibit 
a much faster speed of convergence of trajectories to the origin. Indeed, the issue of speed of 
the convergence is a very important question in itself which we have not yet addressed. For 



the asymmetric map M in Theorem |2.1| the rate of convergence of trajectories to the fixed 
point X = 0, considered as a function of the discrete time n, will invariably decrease as p ^ 1. 
However, it is not clear whether this rate may be constant as a function of the sampling rate 
A, when the input fn — f{j) represent samples of a band-limited function, and p = 1 — ^. 
This is an interesting direction for future work. 

At this point, let us step back to the original motivation for this work, which was to suppress 
periodicities and quasiperiodicities in the discrete output sequence Qn in the SA recursion 
(10) at the onset of stretches of low-amplitude |/;^| << 1. Our assumption that f^^O over 
a segment of time is only an idealization of this situation, and the appropriate generalization 
of 'quietness' for general low-amplitude input is not immediately clear. However, at the onset 
of zero-mean input having sufficiently small amplitude and sufficiently high frequency of os- 
cillation, we conjecture that Qn = remains a 'fixed state'; this situation occurs for instance 
at the onset of vanishing input subject to additive noise. 

Finally, it would be interesting to generalize our results to higher dimensional piecewise affine 
maps, such as the maps corresponding to higher order SA recursions, ([7|. 



5 Proof of Proposition 3.5 



In this section we prove Proposition |3.5| showing that the region where Lyapunov function 
V{u^v) = i/^ + \2v — u\ has positive forward difference is contained in the set R — RiVJ R2 
defined by 

R^ = {{u,v) \ 7i/ + i;>0, 2v^u<l, < 1/2}, 

i?2 = {{u,v) \ 7i/ + i;<0, 2v^u>-l, > -1/2}. 

Proof. That R is contained in the trapping set S is straightforward. Regarding the first part 
of the proposition, consider the partition of R into Ai = {{u^v) G : 7ix + > 1/2}, 
A+ = {{u,v) G : < 7ix + < 1/2}, K~ = {{u,v) G : -1/2 < 7^/ + < 0}, and 
A_i = {{u,v) G : 7^/ + < -1/2}. Suppose that {u,v) G Ai \ so that {u',v') = 
T{u,v) = {u — l,u ^ V — 1). Our first aim is to show that V{T{u,v)) < V{u,v) in this 
situation. 

1. Case 1: U2v-u > 0, thenV(u,v) = u'^^2v-u, whileV{u\v') = u'^-2u+l^\u+2v-l\, 
so 

V{u\ v) < V{u, v) \u + 2v -1\ <u^2v -1 u^2v>l. 

But since {u^ v) ^ Ai \ we know that u > 1/2, so u + 2v > 2u > 1, and V{u', v') < 
V{u,v) holds in this case. 

2. Case 2: U 2v — u < 0, then V{u, v) — v? ^ u — 2v, while the expression for V{u\ v') 
remains unchanged; thus, 

V{u, v) < V{u, v) \u + 2v -1\ <3u-2v -1. (22) 
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We split this case into two subcases: 

(a) Case 2(a): If, on the other hand, u -\- 2v > 1^ then \u^2v — 1\ = + 2^; — 1, and 
( [22| ) simphfies to 

V{u', v') < V{u, v) u + 2v -1 <3u-2v -1 2v < u. 

But since 2v — u <{)hy assumption, the result holds in this subcase. 

(b) Case 2(b): liu + 2v<l, then 

V{u, v) < V{u, v) -u-2v + l<3u-2v-l 

u>l/2. (23) 

But of course, the condition u > 1/2 holds throughout Ai \ 

We have shown thus far that V{u\ v') < V{u, v) if {u, v) G Ai \ It remains to show that 

V{u,u v) < V{u,v) if (u,v) G Aq \ By inspection of Figure 4, this region consists of 

two disjoint sets: Pi : {{u^v) : < + v < l/2^u -\- 2v > < 0}, and P2 : {{u^v) : < 
-iu^v < l/2,u^2v <-l,u> 0}. 

1. Case 1: (ix, v) ^ Pi\ As Pi C {(^x, i;) : < 0, > 0}, the restriction 2v — u>{) trivially 
holds, and so h{u^ v) — v? ^2v — and 

h{u, u + v) < h{u^ v) <^=4> + \2{u + v) — u\ <y? + 2v — u 

\2v ^ u\ < 2v — u 
2v ^ u < 2v — u 
u<0 

which is satisfied by assumption. 

2. Case 2: {u^v) G P2 • ^2 is contained in the quadrant {{u^v) : u > O^v < 0}, and so 

2v — u < 0, h{u, v) — V? — 2v ^ and 

h{u, u + v) < h{u, v) <^=4> |i/ + 2i;| < u — 2v —{u + 2v) < u — 2v u > 0^ 

which again is satisfied by assumption. 

By symmetry of the set R and the map T, the symmetric result, that V{T(u^ v)) < V(u^ v) if 
{u, v) G A_i \ i?2 5 also holds. 

□ 
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